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(57) ABSTRACT 

A method and apparatus for graphics processing that utilizes 
multiple graphics processors in parallel is presented. A 
primary graphics processor is operably coupled to a primary 
memory that includes a primary color buffer and a primary 
Z buffer. The primary processor processes a first portion of 
the image data for a frame, where processing the first portion 
stores color data in the primary color buffer and Z data in the 
primary Z buffer. A secondary processor is operably coupled 
to a secondary memory that includes a secondary color 
buffer and a secondary Z buffer. The secondary processor 
processes a second portion of the image data for the frame. 
The processing of the second portion of the image data 
results in color data being stored in the secondary color 
buffer and Z data being stored in the secondary Z buffer. The 
display signal that results in the image data for the frame 
being displayed is generated by a display driver included in 
the primary processor. In one embodiment, the display 
driver retrieves all of the color data used to generate the 
display signal from the primary color buffer. As such, the 
secondary processor transfers the color data for the second 
portion of the frame from the secondary color buffer to the 
primary color buffer to facilitate generation of the display 
signal. This data transference preferably occurs utilizing 
direct memory access (DMA) transfers that may be initiated 
during the vertical blanking interval portion of the display 
signal. 

24 Claims, 4 Drawing Sheets 
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METHOD AND APPARATUS FOR GRAPHICS FIG. 2 illustrates a block diagram of another graphics 

PROCESSING USING PARALLEL GRAPHICS processing circuit in accordance with a particular embodi- 

PROCESSORS ment of the present invention; 

FIG. 3 illustrates a block diagram of yet another graphics 

FIELD OF THE INVENTION 5 P rocessm S circuit in accordance with a particular embodi- 
ment of the present invention; 

The invention relates generally to graphics processing and fig. 4 illustrates a flow diagram of a method for graphics 

more particularly to a method and apparatus for graphics ssi in accordance wit g a parlicu i ar embodiment of 

processing using parallel graphics processors. f, * • j 

the present invention; and 

BACKGROUND OF THE INVENTION 10 FIG. 5 illustrates a flow diagram of another method for 

Computers are used in many applications. As computing graphics processing in accordance with a particular embodi- 

systems continue to evolve, the graphical display require- ment of the present invention, 

ments of the computing systems become more demanding. DETAILED DESCRIPTION OF THE 

This is especially true ,n apphcauons where delated three- PREFERRED EMBODIMENT OF THE 

dimensional graphical displays must be updated quickly. 15 TwvpisrnnM 
One example of such an application is a computer game 

where movement and modification of numerous images on Generally, the present invention provides a method and 

the screen may place great demands on the processing power apparatus for graphics processing that utilizes a primary and 

of the computing system. a secondary graphics processor. The primary graphics pro- 

In a typical video graphics processing system, a single 20 cessor is operably coupled to a primary memory that 

graphics processor utilizes a frame buffer that stores color includes a primary color buffer and a primary Z buffer. The 

and Z information for each of the pixels that make up the primary processor processes a first portion of the image data 

display frame. The graphics processor renders images into for a frame, where processing the first portion stores color 

the frame buffer, where the rendering may include compar- data { n fo c primary color buffer and Z data in the primary Z 

ing color and Z values of received or generated image data buffer. The secondary processor is operably coupled to a 

with that already stored in the frame buffer. Once rendering SCC ondary memory that includes a secondary color buffer 

of the display image has completed in the frame buffer, the md a ^0^3^ z buffer . secondary processor processes 

color data stored in the color portion ot the irame buffer is a sec(md i(m of ^ - c 

data for the frame. The 

read out and usee 1 to generate the disp ay signal that is ssi of ^ second rtion of the im data results m 

provided to the display. As the complexity of images i j * i_ ■ * j • *i_ j lure j rr 

increases and the processing operations corresponding to 30 «dor data being stored lin the secondary ^color buffer ^and Z 

those images also increases, the processing requirements data store f >? the secondary Z buffer. The display 

may exceed the processing limitations of such graphics "Wf 1 that . results m } h * image data for the frame being 

processors displayed is generated by a display dnver included in the 

One technique that can increase the graphics processing P rimar y Processor. In one embodiment, the display driver 

bandwidth in a system is to employ multiple processors to 35 retrieves all of the color data used to generate the display 

perform the graphics processing operations. Prior art sys- si 8 nal from &e Primary color buffer. As such, the secondary 

terns that incorporated multiple processors to perform the processor transfers the color data for the second portion of 

graphics processing, for a single display have suffered from ^ frame flom me secondary color buffer to the primary 

a number of drawbacks that reduce their efficiency of buffer to facilitate generation of the display signal, 

operation and cost effectiveness. 40 ^ data transference preferably occurs utilizing direct 

One such prior art system employed two processors where ™ m0J ?. a< f ff <-™ A ) * rans f els ma /, |» ™ ti f d durin 8 

each processor has its own local memory for performing the vertlcal blankul 8 mterval P ortlon of the dls P la y 

graphics processing operations. Each graphics processor B V ut,hzm 8 ^ ot more processors to cooperatively 

produced a display stream based on the portion of the P rocess «•» ™ a e e data for a frame ' additional processing 

display frame that it was responsible for processing. An 45 bandwidth can be leveraged to perform complex graphics 

external combination block was then used to combine the processing operations. Because each of the processors is 

display stream portions generated by each of the graphics c0U P led 10 lts own local memory, the processing operations 

processors to form a composite display stream. mat typically require multiple reads and writes to such 

Unfortunately, such prior art systems implement the display memor y can be performed in an efficient manner, 

driver, which typically includes a digital to analog converter 50 However, once the processing operations are complete and 

for generating an analog display signal, as a separate com- me ima S e data ,0 be displayed is stored in the respective 

ponent or integrated circuit. This increases the cost associ- local memories, the image data can be combined into a 

ated with such an implementation. Another problem is the sm ^ e mem0T V and read out b y a dls P la y dnver t0 g enerate 

requirement for a high level of synchronization between the me dB P la y S1 gnal. 

pixel clocks corresponding to each of the graphics proces- 55 The invention can be better understood with reference to 

sors. This additional requirement further increases costs and FIGS - !- 5 - mG - 1 illustrates a graphics processing circuit 

reduces the potential for implementing the system in a 1°° mat includes a primary processor 110, a secondary 

commercially viable manner. processor 150, a primary memory 120, and a secondary 

Therefore, a need exists for a method and apparatus for memory 1 *°-.? e , primary and . secondary processors may 

graphics processing that utilizes two or more graphics «° each be individual integrated circuits. Each of the primary 

processors in parallel in a manner that does not require an and secondary processors 110 and 150 preferably include 

external display driver circuitry that makes up a three-dimensional graphics ren- 
dering pipeline. Thus, the rendering block 112 and the 

BRIEF DESCRIPTION OF THE DRAWINGS rendering block 152 are preferably capable of rendering 

FIG. 1 illustrates a block diagram of a graphics processing 65 three-dimensional images, 

circuit in accordance with a particular embodiment of the The primary memory 120, which is operably coupled to 

present invention; the primary processor 110, includes a primary color buffer 
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122 and a primary Z buffer 124. The primary processor transferred from the secondary color buffer 162 into the 

processes a first portion of image data for a frame, where the primary color buffer 122. 

processing operations for the first portion store color data for ^ data trarJ sfer indication provided to the secondary 

the first portion in the primary color buffer 122 and Z data processor 150 may be relayed using a dedicated signal line 

for the first portion in the primary Z buffer 124. Preferably, s mat jg operab ly CO upled to the primary and secondary 

the primary processor 110 recognizes the relevant portions ^ In ^ £ mbo diments, me primary processor 

of graphics primitives that it receives, where relevant por- i 1ft J(L , * A . i . 

tioS are IhL portions of the primitives that effect pixels 110 md *° ^ndary processor 150 may be intercoupled 

included in the first portion of the frame. usln S a shared bus st ™ cture 172 > wh « re A * ared b " s 

™ „. t; j • • structure couples the primary processor 110 and the second- 

The processmg operauons performed by the primary „ a ^ centr i P p rocess ing unit (CPU) 180 of 

processor 110 with regard to the first portion of the image J r . A T r , <? . V. ■ 

data for the frame include rendering operations that may be processing system. In one embodimen the primary 

c j L <u j - liiii^ou j • processor 110, the primary memory 120, the secondary 

performed by the rendering block 112. Such rendering r . ^ A ' *_ J , J J 

r . , j u - • a memory 160, and the secondary processor 150 may be 

operations include receiving graphics primitives and gener- . , , * . ,. J K « . 

f. ic * u- u *u t • j • . i - , included in a video graphics expansion card that is used in 

a ting pixel fragments which are then combined with stored 1<; . . T . ... ... . , 

. , ^ . . .r , , „ ... „ 15 a personal computer. In such an embodiment, the central 

pixel data in the primary color buffer 122 and the primary Z r % DA 4 iL , . ' r it _ 

u a: a * ( 4 L £ 1* f .u c * r processing unit 180 represents the central processor of the 

buffer 124 to generate the final image for the first portion of r ♦ u • u • ui i j * 1 nn -m. 

,i j • i ^ „ , t . , & , F j ■ computer, which is operably coupled to memory 190. The 

the display. The Z data is the depth coordinate corresponding 4 i • ^ on . a. * j ■ 

t . . ■ « . .i * < . , ,\ ° central processing unit 180 may execute software stored in 

to a pixel that is used in three-dimensional graphics pro- 1Qft ,k n , ■„ t u fl p .r«v.^, 

- 4 j * • -r i i t i-- the memory 190 that results in the generation ot different 

cessing to determine if a particular portion of a graphics ^ ^ e& ^ ^. j a ° 

primitive is visible. The Z data is used to determine which ^ ra ^ ICS ima £ es 01 ls P av * 

primitive portions he behind portions of other primitives and ^ shared bus structure 172 may include a bridge circuit 

are therefore not visible to a viewer. The use of such Z 170 as shown in FIG. 1, where the bridge circuit 170 couples 

values and three-dimensional rendering operations are well the P rimarv and secondary processors 110 and 150 to the 

known in the art central processing unit 180. Note that additional processors, 

The secondary' memory 160 includes a secondary color memo ^ 01 ^circuits may also be coupled to the shared 

buffer 162 and a secondary Z buffer 164. The secondary bus structure 172 

memory 160 is operably coupled to the secondary processor [n me case where the graphics processing circuit 100 

150, and the secondary processor 150 processes a second includes the central processing unit 180, the data transfer 

portion of the image data for the frame. Preferably, the first 30 indication may be generated by the primary processor U0 or 

and second portions of the image data make up the complete b y the central processing unit 180. Preferably, the data 

set of image data for a frame. As the secondary processor transfer indication is generated based on the display signal 

150 processes the second portion of the image data for the 118 sucb mat me data transfer occurs during a portion of the 

frame, it stores color data for the second portion in the display signal 118 in which data is not being actively fetched 

secondary color buffer 162 and stores Z data for the second 3S from me primary color buffer 122. In one embodiment, the 

portion in the secondary Z buffer 164. The processing data transfer indication is generated based on a vertical 

operations performed by the secondary processor 158 blanking interval (VBI) portion of the display signal. The 

include rendering operations that may be performed by the vertical blanking interval corresponds to the point in the 

rendering block 152 that is included in the secondary display signal where the scanning device has reached the 

processor 150. 40 bottom of the display and is being reset to the top of the 

Because the secondary processor 150 only processes display. This represents a point during which the display 

image data for the second portion of the frame, the second- si S nal 118 ^ not delivering pixel color data to the scanning 

ary processor 150 preferably recognizes and only acts upon device. As such, the bandwidth of the primary memory 120 

image data that is relevant to the second portion of the frame. 18 available for receiving data from the secondary color 

Thus, if a primitive is received that includes pixel coordi- 45 bu ^ er 1^2. 

nates that only correspond to pixels included in the first In order to facilitate efficient and timely transfer of the 

portion of the frame, the secondary processor 150 will color data from the secondary color buffer 162 to the 

disregard such a primitive. If a primitive is received that primary color buffer 122, direct memory access (DMA) 

includes pixel coordinates that correspond to both the first transfers may be utilized. DMA transfers are well under- 

and second portions of the frame, each of the primary and 50 stood in the art. The DMA transfers may be performed by 

secondary processors 110 and 150 will process the primitive DMA controllers 114 and 154 that are included in the 

at least to the point where the relevant pixels for that primary processor 110 and the secondary processor 150, 

primitive included in their respective portions are processed. respectively. Each DMA controller preferably generates one 

The secondary processor 150 is operably coupled to the of me source and destination addresses required for the 

primary processor 110 in a manner such that it is capable of 55 DMA transfers. Thus, if color data is being transferred from 

receiving a data transfer indication from the primary pro- tne secondary color buffer 162 into the primary color buffer 

cesser 110. The data transfer indication instructs the sec- 122, the DMA controller 154 generates the source addresses 

ondary processor 150 to transfer the color data stored for the used 10 fetch & & color data from the secondary color buffer 

frame in the secondary color buffer 162 to the primary color 162, ^d the DMA controller 114 generates the destination 

buffer 122. In the embodiment illustrated in FIG. 1, the 60 addresses used for storing the data being transferred into the 

display driver 116 of the primary processor 110 generates primary color buffer 122. 

the display signal 118 by fetching color data from the The DMA controllers 114 and 154 are intercoupled to 

primary color buffer 122. Preferably, the display driver 116 facilitate the transfer of color data, and may be intercoupled 

includes a digital to analog convert such that an analog by the DMA bus 130. When the DMA bus is used for the 

display signal can be generated. Thus, in order to generate 65 DMA transfers, the DMA bus 130 may include at least one 

a display signal 118 that encompasses the entire frame, the data signal and a clock signal. More preferably, the DMA 

color data for the second portion of the frame must be bus 130 may include multiple data signals along with the 
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clock signal, and in one embodiment, eight data signals and 
a clock signal may be included in the DMA bus 130. In order 
to provide a fast means of data transfer, the data portion of 
the DMA bus 130 may be "double clocked", such that valid 
data is present on the data portion of the DMA bus 130 at 5 
both the rising and falling edges of the clock portion of the 
DMA bus. Maintaining a limited number of data signals on 
the DMA bus 130 can help to reduce the number of pins 
required for packaging of the primary processor 110 and the 
secondary processor 150 when each is manufactured as an 10 
individual integrated circuit 

DETAILED DESCRIPTION OF EMBODIMENT 
OF INVENTION 

In other embodiments, the secondary processor 150 may 15 
include a DMA controller that transfers the color data for the 
second portion of the frame to the primary color buffer via 
the shared bus structure 172 or through some other link to 
the primary memory 120. In such an embodiment, the DMA 
controller included in the secondary processor 150 may 20 
perform all of the necessary addressing for the DMA trans- 
fers. Thus, the DMA controller within the secondary pro- 
cessor 150 may generate both the source addresses corre- 
sponding to the secondary color buffer 162 in the secondary 
memory 160 and the destination addresses corresponding to 25 
the primary color buffer 122 in the primary memory 120. 

Although block DMA transfers are preferable for trans- 
ferring large amounts of color data from the secondary 
memory 160 to the primary memory 120, DMA or block 3Q 
transfers are not mandatory for operation of the system 
illustrated in FIG. 1. In some embodiments, the primary 
processor 110 issues individual memory read requests to the 
secondary processor 150 which facilitates fetching of the 
required data from the secondary memory and relays the 35 
data back to the primary processor 110. 

The division of the processing responsibilities with 
respect to the frame may be performed in a variety of ways. 
In one embodiment, the first and second portions essentially 
separate the frame in halves. Thus, the first portion may 40 
include the left half of the frame and the second portion the 
right half of the frame. In other embodiments, the division 
may not be equal, which may be advantageous in instances 
where either the primary or secondary processor 110 or 150 
has more available processing bandwidth than the other. In 45 
some embodiments, the division of processing responsibili- 
ties may be performed in a dynamic manner such that the 
first and second portions of the image data for the frame are 
dynamically determined. For example, the second processor 
150 may be actively performing other functions and only 50 
utilized as a reserve processor when the primary processor 
110 becomes overloaded. 

Because the secondary processor 150 is only processing a 
portion of the image data for the frame, the secondary 
memory 160 need only allocate storage locations for those 55 
pixels for which the secondary processor 150 is responsible. 
Thus, if the secondary processor 150 processes the right half 
of the frame, the secondary memory 160 need only allocate 
memory locations for pixels included in that half of the 
frame. Thus, the secondary color buffer 162 will include a $q 
corresponding pixel location for each pixel included in the 
second portion of the frame. Similarly, the secondary Z 
buffer 164 will only include a corresponding pixel location 
for each pixel included in the second portion of the frame. 

In the circuit 100, the primary memory 120 is responsible 65 
for storing the entire set of color data for the frame in the 
primary color buffer 122. This is because the primary color 
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buffer 122 is relied upon for retrieving the color data utilized 
for generating the display signal 118. As such, the primary 
color buffer 122 must include a corresponding pixel location 
for each pixel included in the frame. However, because the 
primary processor 110 is only responsible for performing 
processing operations on the first portion of the frame, the 
primary Z buffer 124 need only include a corresponding 
pixel location for each pixel included in the first portion. 
This is because the Z data for each pixel location is only used 
in rendering operations, and is not required for generating 
the display signal 118. The display signal 118 is generated 
from color data stored in the primary color buffer 122 and 
does not require any Z information. 

Note that in some embodiments, the transfer of color data 
from the second memory 160 to the primary memory 120 
may not be accomplished rapidly enough to permit the 
primary color buffer 122 to be utilized for both storage of 
incoming transferred data and for sourcing the color data 
required for generation of the display signal 118 by the 
display driver 116. In such cases, the primary memory 120 
may include a first color buffer and a second color buffer 
where color data is transferred from the secondary memory 
to the first color buffer when the second color buffer is used 
as a display color buffer, and where the color data is 
transferred to the second color buffer when the first color 
buffer is used as the display color buffer. The display driver 
116 generates the display signal from color data stored in 
whichever of the two color buffers is currently the display 
color buffer. Whichever of the first and second color buffers 
is not currently acting as a display buffer acts as the primary 
color buffer in that the processing operations performed by 
the primary processor 110 store color data in the primary 
color buffer, and data is transferred from the secondary 
memory 160 into the primary color buffer. 

As is apparent to one of ordinary skill in the art, the use 
of multiple processors can be extended to include more than 
two processors where DMA transfers or other means of data 
transfer enable rendered color data to be combined for 
generation of a display signal. Similarly, each of the primary 
and secondary processors 110 and 150 may include addi- 
tional functionality that utilizes the capabilities of the DMA 
controllers 114 and 154 to promote data exchange required 
for these additional capabilities. One such example is illus- 
trated in FIG. 2. 

FIG. 2 illustrates an alternate graphics processing circuit 
200 that includes a primary memory 220, a secondary 
memory 260, a primary processor 210, and a secondary 
processor 250. As was the case with the graphics processing 
circuit of 100 of FIG. 1, the graphics processing circuit 200 
of FIG. 2 may include a shared bus structure 172, bridge 
circuit 170, central processing unit 180, and memory 190. 
Each of the primary and secondary processors 210 and 250 
preferably includes three-dimensional graphics processing 
capabilities as described with respect to the processors 110 
and 150 of FIG. 1. 

The graphics processing circuit 200 is similar to the 
graphics processing circuit 100 with the additional capabil- 
ity that the secondary processor 250 generates an additional 
display signal 258 using a built-in display driver 256. 
Preferably, the display driver 256 includes a digital analog 
converter such that an analog display signal can be gener- 
ated. In the embodiment illustrated in FIG. 2, each of the 
primary and secondary processors 210 and 250 is respon- 
sible for generating a display signal for a separate frame. 
Portions of the processing responsibilities for different pixel 
locations in each of these two frames may be divided 
between the primary and secondary processors 210 and 250 
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such that the primary processor 210 may perform some of unit 180 may be through a shared bus structure 172 that may 

the processing functions for the frame data that results in the include a bridge circuit 170. 

display signal 258 and the display signal 218, and the [ n the embodiment illustrated in FIG. 3, the secondary 
secondary processor 250 may perform some of the process- processor 350 processes a second portion of the image data 
ing functions for the frame data used to generate the display 5 f or the frame. The secondary processor 350 utilizes the 
signal 218 and the display signal 258. secondary memory 360 as local memory. Thus, the process- 
In order to facilitate this division of processing ing operations performed by the secondary processor 350 
responsibilities, the primary memory 220 is shown to store color data in the secondary color buffer 362 and store 
include a primary color buffer 222 and a primary Z buffer Z data in the secondary Z buffer 364. 
224. The primary color buffer 222 and primary Z buffer 224 ™ The primary processor 310 processes a first portion of the 
correspond to processing operations for the frame that image data for the frame and stores color data in the primary 
results in the display signal 218 . Thus, the display driver 216 co i or buffer 322 and Z data in the primary Z buffer 324. Note 
will retrieve the color data required to generate the display that the processing operations performed by the primary and 
signal 218 from the primary color buffer 222, secondary processors 310 and 350 preferably include three- 
As was the case with the processing circuit of FIG. 1, the 15 dimensional graphics processing operations such that each 
secondary memory 260 includes a secondary color buffer includes a three-dimensional rendering graphics processing 
262 and a secondary Z buffer 264 that correspond to the pipeline. 

frame used for generation of the display signal 218. Thus, [ n the embodiment illustrated in FIG. 3, the display signal 

the primary processor 210 or the CPU 180 generates data 318 is generated by a display driver 316 that is included 

transfer indications that are relayed to the secondary pro- 20 within the primary processor 310. Thus, the inclusion of the 

cessor 250 such that color data stored within the secondary display driver 316, which may include a digital to analog 

color buffer 262 is transferred to the primary color buffer converter, in the primary processor 310 enables the advan- 

222 for use in generating the display signal 218. As was the tages of including the display driver within the processing 

case with the circuit of FIG. 1, the transfer of data may circuit to be realized. However, the display driver 316 is 

utilize DMA transfers via the DMA controllers 214 and 254 25 configured such that it is able to generate the display signal 

that may be intercoupled via a DMA bus 230. 318 by retrieving color data from both the primary color 

The secondary memory 260 is also shown to include a buffer 322 and the secondary color buffer 362. Thus, the 

third color buffer 266 and a third Z buffer 268. The third display driver 316 is able to directly access both of the color 

color buffer 266 and the third Z buffer 268 correspond to the 3Q buffers 322 and 362 to get the color data it requires to 

frame used for generation of the display signal 258. Thus, generate the display signal 318. This eliminates the need for 

the display driver 256 will generate the display signal 258 transferring the data from the secondary memory 360 to the 

based on color data stored within the third color buffer 266. primary memory 320 as was required in the circuit of FIG. 

In order for the primary processor 210 to assist the second- 1. 

ary processor 250 in processing graphics information for the 35 Because the color data used to generate the display signal 

frame used to produce the display signal 258, the primary 318 is fetched from both the primary color buffer 322 and 

memory 220 includes a fourth color buffer 226 and a fourth the secondary color buffer 362, the size of the primary color 

Z buffer 228. The fourth color buffer 226 and the fourth Z buffer 322 need only be such that a pixel location for each 

buffer 228 function in a similar manner for the display signal pixel within the first portion of the frame be included in the 

258 as the secondary color buffer 262 and the secondary Z 4Q primary color buffer 322. Similarly, the secondary color 

buffer 264 function for the display signal 218. Similar DMA buffer 362 will only include pixel locations for those pixels 

transfers to those used to move data from the secondary included in the second portion of the frame. Note that the 

memory 260 to the primary memory 220 can be employed primary and secondary color buffers 322 and 362 may lie on 

in the opposite direction to enable data to be moved from the a common bus that is coupled to the primary processor 310 

primary memory 220 the secondary memory 260. 45 such that the addresses asserted on the bus to retrieve data 

In an embodiment such as that illustrated in FIG. 2, the from the color buffers select which of the color buffers will 

dynamic division of the processing responsibilities for the return the required color data. 

different frames may become more relevant to maximizing FIG. 4 illustrates a flow diagram of a method for graphics 

the available processing power of the two processors. If one processing, where the method illustrated in FIG. 4 includes 

processor is required to process a very complicated image, 50 some of the general functionality of the circuitry included in 

it can offload some of the processing responsibilities onto FIG. 3. The method begins at step 402 where image data for 

the other processor, where the other processor may be a first portion of a frame is processed using a first processor, 

processing a much simpler image and thus have bandwidth Preferably, the processing operations performed include 

to spare. Thus, the variance in the requirements for process- three-dimensional processing operations such as three- 

ing bandwidth can be spread amongst multiple processors, 55 dimensional rendering. The processing operations per- 

ensuring that the needs of each display are adequately met. formed by the first processor store color data in a primary 

FIG. 3 illustrates another embodiment of a graphics color buffer and Z data in a primary Z buffer. The primary 

processing circuit 300 that allows the processing bandwidth color buffer and the primary Z buffer are preferably included 

requirements for a particular frame to be divided amongst in a primary memory structure. 

multiple processors. The graphics processing circuit 300 of 60 At step 404, image data for a second portion of the frame 

FIG. 3 is shown to include a primary memory 320, a is processed using a second processor. The processing 

secondary memory 360, a primary processor 310, and a operations performed by the second processor are preferably 

secondary processor 350. As was the case with the circuit similar to those of the first processor, but directed toward a 

100 of FIG. 1, the primary and secondary processors 310 and different portion of the frame. Preferably, the combination of 

350 may be coupled to a central processing unit 180 that 65 the first and second portions of the frame equates to the 

executes software stored in the memory 190. The intercon- frame in its entirety. The processing operations performed 

nection between the processors and the central processing by the second processor store color data in a secondary color 
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buffer and Z data in a secondary Z buffer. The secondary 
color buffer and the secondary Z buffer are included in a 
secondary memory structure. 

Once processing for a particular frame has completed, a 
display signal is generated at step 406. The display signal is 5 
generated using a display driver that is included in the first 
processor, where the display driver generates the display 
signal from color data stored in the primary color buffer and 
the secondary color buffer. Thus, the display driver is able to 
retrieve color data for generating the display signal from 10 
both the primary and secondary color buffers. Note that the 
primary processor is preferably implemented as an inte- 
grated circuit that includes the display driver such that the 
advantages of including the display driver within the inte- 
grated circuit of the graphics processor are realized. 

FIG. 5 illustrates a flow diagram of another method for 
graphics processing, where the flow diagram of FIG. 5 
corresponds to the general methodology associated with the 
circuitry illustrated in FIG. 1. The method begins at step 502 
where image data for first portion of a frame is processed 2Q 
using a first processor. Step 502 is substantially similar to 
step 402 of FIG. 4 in that the type of processing is the same, 
and a primary color buffer and primary Z buffer included in 
a primary memory are utilized in the processing. 
Correspondingly, step 504 is similar to step 404 in that step 25 
504 includes processing image data for a second portion of 
the frame using a second processor that stores color data in 
a secondary color buffer and Z data in a secondary Z buffer. 

At step 506, a data transfer indication is generated that is 
provided to the second processor. Preferably, the generation 30 
of the data transfer indication is based on a vertical blanking 
interval or other portion of the display signal such that the 
data transfer can occur in an efficient manner that minimizes 
any impact on generation of a display signal. Upon receipt 
of the data transfer indication, the second processor transfers 35 
color data stored in the secondary color buffer to the primary 
color buffer. Such data transfer may occur as illustrated at 
step 508 such that the transfer is accomplished via DMA 
operations. 

Finally, at step 510, a display signal is generated from 40 
color data stored in the primary color buffer. In the method 
illustrated in FIG. 5, all of the relevant color data needed for 
generating the display signal has been stored or relocated to 
the primary color buffer such that the display signal can be 
generated from color data fetched exclusively from the 45 
primary color buffer. The primary color buffer includes color 
data generated by both the first and second processors, thus 
allowing the parallel processing operations of the processors 
to jointly render the image data for the frame. 

By enabling multiple graphics processors to cooperate in 50 
performing the graphics processing operations required for 
generating image data for a particular frame, the processing 
bandwidth in a graphics processing system can be increased. 
By allowing each of the processors to utilize local memory 
in performing the processing operations that may require 55 
multiple memory accesses, the efficiency with which the 
processing can be performed is increased. Once the image 
data for the frame has been generated, the portions generated 
by each processor can be combined into a single memory 
structure such that a display driver can easily fetch the color 60 
data it requires to generate a display signal. In other 
embodiments, the display driver is capable of fetching the 
color data it requires from multiple memory structures such 
that the transference of the color data into a single memory 
structure is not required. 65 

It should be understood that the implementation of other 
variations and modifications of the invention in its various 
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aspects will be apparent to those of ordinary skill in the art, 
and that the invention is not limited to the specific embodi- 
ments described. It is therefore contemplated to cover by the 
present invention any and all modifications, variations, or 
equivalents that fall within the spirit and scope of the basic 
underlying principles disclosed and claimed herein. 
What is claimed is: 

1. A graphics processing circuit, comprising: 

a primary memory that includes a primary color buffer 
and a primary Z buffer; 

a secondary memory that includes a secondary color 
buffer and a secondary Z buffer; 

a primary processor operably coupled to the primary 
memory, wherein the primary processor processes a 
first portion of image data for a frame, wherein pro- 
cessing the first portion stores color data for the first 
portion in the primary color buffer and Z data for the 
first portion in the primary Z buffer, wherein the 
primary processor includes a display driver, wherein 
the display driver generates a display signal from color 
data stored in the primary color buffer, and 

a secondary processor operably coupled to the secondary 
memory and the primary processor, wherein the sec- 
ondary processor processes a second portion of the 
image data for the frame, wherein processing the sec- 
ond portion stores color data for the second portion in 
the secondary color buffer and Z data for the second 
portion in the secondary Z buffer, wherein the second- 
ary processor receives a data transfer indication from 
the primary processor, wherein the secondary processor 
transfers color data for the second portion from the 
secondary color buffer to the primary color buffer in the 
primary memory based on the data transfer indication. 

2. The graphics processing circuit of claim 1 wherein the 
data transfer indication is relayed using a signal line oper- 
ably coupled to the primary and secondary processors. 

3. The graphics processing circuit of claim 1 further 
comprises a shared bus structure operably coupled to the 
primary and secondary processors, wherein the shared bus 
structure couples the primary and secondary processors to a 
central processing unit. 

4. The graphics processing circuit of claim 3 wherein the 
central processing unit generates the data transfer indication. 

5. The graphics processing circuit of claim 3 wherein the 
data transfer indication is relayed using the shared bus 
structure. 

6. The graphics processing circuit of claim 3 further 
comprises a bridge operably coupled to the primary and 
secondary processors, wherein the bridge couples the pri- 
mary and secondary processors to the shared bus structure. 

7. The graphics processing circuit of claim 1 wherein the 
primary color buffer includes a corresponding pixel location 
for each pixel included in the frame, wherein the primary Z 
buffer includes a corresponding pixel location for each pixel 
included in the first portion, and wherein the secondary color 
buffer and the secondary Z buffer each include a correspond- 
ing pixel location for each pixel included in the second 
portion. 

8. The graphics processing circuit of claim 1 wherein 
primary and secondary processors each include a direct 
memory access (DMA) controller, wherein the DMA con- 
trollers are intercoupled to facilitate transfer of color data. 

9. The graphics processing circuit of claim 8 further 
comprises a DMA bus operably coupled to the DMA con- 
trollers of the primary and secondary processors, wherein 
the DMA bus includes at least one data signal and a clock 
signal. 
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10. The graphics processing circuit of claim 9 wherein the 
primary processor generates the data transfer indication 
based on a vertical blanking interval corresponding to the 
display signal. 

11. The graphics processing circuit of claim 1 wherein 5 
each of primary and secondary processors include a three- 
dimensional graphics rendering pipeline. 

12. The graphics processing circuit of claim 1 wherein the 
secondary processor includes a DMA controller that trans- 
fers the color data for the second portion to the primary color 
buffer via a shared bus structure that intercouples the pri- 
mary and secondary processors, wherein the shared bus 
structure couples the primary and secondary processors to a 
central processing unit, and wherein the DMA controller 
generates source addresses corresponding to the secondary 
memory and destination addresses corresponding to the 15 
primary memory. 

13. The graphics processing circuit of claim 1 wherein the 
first and second portions of the image data for the frame are 
dynamically determined. 

14. The graphics processing circuit of claim 1, wherein: 20 
the secondary memory includes a third color buffer and a 

third Z buffer; 

the primary memory includes a fourth color buffer and a 
fourth Z buffer; 25 

the secondary processor processes a first portion of image 
data for a second frame, wherein processing the first 
portion of image data for the second frame stores color 
data in the third color buffer and Z data in the third Z 
buffer, wherein the secondary processor includes a 30 
second display driver that generates a second splay 
signal from color data stored in the third color buffer; 
and 

the primary processor processes a second portion of the 
image data for the second frame, wherein processing 35 
the second portion of image data for the second frame 
stores color data in the fourth color buffer and Z data in 
the fourth Z buffer, wherein the primary processor 
receives a second data transfer indication from the 
secondary processor, wherein the primary processor 40 
transfers color data for the second portion of the image 
data for the second frame from the fourth color buffer 
to the third color buffer based on the second data 
transfer indication. 

15. The graphics processing circuit of claim 1 wherein the 45 
primary memory includes a first color buffer and a second 
color buffer, wherein the color data is transferred from the 
secondary memory to the first color buffer that is acting as 
the primary color buffer when the second color buffer is used 

as a display color buffer, wherein the color data is transferred 50 
from the secondary memory to the second color buffer that 
is acting as the primary color buffer when the first color 
buffer is used as the display color buffer, wherein the display 
driver generates the display signal from color data stored in 
the display color buffer. 55 

16. A graphics processing circuit, comprising: 

a primary memory that includes a primary color buffer 
and a primary Z buffer, wherein the primary color 
buffer includes a pixel location corresponding to each 
pixel of a frame, wherein the primary Z buffer includes 60 
a pixel location for each pixel of a first portion of the 
frame; 

a secondary memory that includes a secondary color 
buffer and a secondary Z buffer, wherein the secondary 
color buffer and the secondary Z buffer each include a 65 
pixel location for each pixel of a second portion of the 
frame; 



a primary graphics processor operably coupled to the 
primary memory, wherein the primary graphics proces- 
sor processes image data corresponding to the first 
portion of the frame, wherein processing for the first 
portion stores color data for the first portion in the 
primary color buffer and Z data for the first portion in 
the primary Z buffer, wherein the primary graphics 
processor includes a display driver, wherein the display 
driver generates a display signal from color data stored 
in the primary color buffer, wherein the primary graph- 
ics processor includes a primary DMA controller that 
generates addresses corresponding to the primary color 
buffer for DMA transfers to the primary color buffer; 
and 

a secondary graphics processor operably coupled to the 
secondary memory and the primary graphics processor, 
wherein the secondary graphics processor processes a 
second portion of the image data for the frame, wherein 
processing the second portion stores color data for the 
second portion in the secondary color buffer and Z data 
for the second portion in the secondary Z buffer, 
wherein the secondary graphics processor receives a 
data transfer indication from the primary graphics 
processor, wherein the secondary graphics processor 
transfers color data for the second portion from the 
secondary color buffer to the primary color buffer in the 
primary memory based on the data transfer indication, 
wherein the secondary graphics processor includes a 
secondary DMA controller that generates addresses 
corresponding to the secondary color buffer for DMA 
transfers to the primary color buffer. 

17. The graphics processing circuit of claim 16, wherein 
the primary DMA controller of the primary graphics pro- 
cessor is operably coupled to the secondary DMA controller 
of the secondary graphics processor via a bus structure that 
includes a plurality of data signals and a clock signal. 

18. The graphics processing circuit of claim 17, wherein 
each of the primary and secondary graphics processors 
include at least a portion of a three-dimensional graphics 
rendering pipeline, wherein each of the primary and sec- 
ondary graphics processors perform three-dimensional pro- 
cessing operations on image data for the frame. 

19. The graphics processing circuit of claim 18, wherein 
the primary graphics processor further comprises a control 
block, wherein the control block generates the data transfer 
indication to correspond to a vertical blanking interval of the 
display signal. 

20. The graphics processing circuit of claim 19, wherein 
at least one of the primary and secondary memories store 
texture data used in processing image data. 

21. A graphics processing circuit, comprising: 

a primary memory that includes a primary color buffer 
and a primary Z buffer; 

a secondary memory that includes a secondary color 
buffer and a secondary Z buffer; 

a primary processor operably coupled to the primary 
memory, wherein the primary processor processes a 
first portion of image data for a frame, wherein pro- 
cessing the first portion stores color data for the first 
portion in the primary color buffer and Z data for the 
first portion in the primary Z buffer, wherein the 
primary processor includes a display driver, wherein 
the display driver generates a display signal from color 
data stored in the primary color buffer and the second- 
ary color buffer; and 

a secondary processor operably coupled to the secondary 
memory and the primary processor, wherein the sec- 



02/02/2004, EAST Version: 1.4.1 



US 6,473,1 

13 

ondary processor processes a second portion of the 
image data for the frame, wherein processing the sec- 
ond portion stores color data for the second portion in 
the secondary color buffer and Z data for the second 
portion in the secondary Z buffer. s 

22. A method for graphics processing, comprising: 
processing image data for a first portion of a frame using 

a first processor, wherein processing operations per- 
formed by the first processor store color data in a 
primary color buffer and Z data in a primary Z buffer, 10 
wherein the primary color buffer and the primary Z 
buffer are included in a primary memory; 
processing image data for a second portion of the frame 
using a second processor, wherein processing opera- 
tions performed by the second processor store color 15 
data in a secondary color buffer and Z data in a 
secondary Z buffer, wherein the secondary color buffer 
and the secondary Z buffer are included in a secondary 
memory; and 

generating a display signal using a display driver included 
in the first processor, wherein the display driver gen- 
erates the display signal from color data stored in the 
primary color buffer and the secondary color buffer. 

23. A method for graphics processing, comprising: ^ 
processing image data for a first portion of a frame using 

a first processor, wherein processing operations per- 
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formed by the first processor store color data in a 
primary color buffer and a primary Z buffer, wherein 
the primary color buffer and the primary Z buffer are 
included in a primary memory; 
processing image data for a second portion of the frame 
using a second processor, wherein processing opera- 
tions performed by the second processor store color 
data in a secondary color buffer and Z data in a 
secondary Z buffer, wherein the secondary color buffer 
and the secondary Z buffer are included in a secondary 
memory; 

generating a data transfer indication that is provided to the 
second processor, wherein upon receipt of the data 
transfer indication, the second processor transfers color 
data stored in the secondary color buffer to the primary 
color buffer; and 

generating a display signal from color data stored in the 
primary color buffer, wherein the color data in the 
primary color buffer includes color data generated by 
the first and second processors. 

24. The method of claim 23, wherein the second processor 
transfers color data to the primary color buffer using a direct 
memory access (DMA) transfer. 

***** 
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